150 research outputs found

    DISKRITISASI EQUAL-WIDTH INTERVAL PADA NAÏVE BAYES (STUDI KASUS: KLASIFIKASI PASIEN TBC)

    Get PDF
    Klasifikasi Naive Bayes merupakan teknik untuk memprediksi probabilitas keanggotaan suatu kelas dengan menerapkan teorema Bayes. Klasifikasi Naive Bayes akan lebih baik jika menggunakan data yang berbentuk kategorik, sehingga dalam penelitian ini digunakan diskritisasi equal-width interval pada Naive Bayes untuk mengubah data yang berbentuk numerik menjadi kategorik. Tujuan dari penelitian ini adalah untuk menerapkan metode Naive Bayes dengan diskritisasi equal-width interval dalam mengklasifikasi pasien TBC di Puskesmas Sewon 1. Hasil penelitian ini menunjukkan akurasi sebesar 100% dengan perbandingan data training dan data testing sebesar 80%:20% dan 90%:10%, sehingga klasifikasi Naive Bayes dapat dikategorikan baik dalam mengklasifikasi pasien TBC

    Distance Functions Study in Fuzzy C-Means Core and Reduct Clustering

    Get PDF
    Fuzzy C-Means is a distance-based clustering process which applied by fuzzy logic concept. Clustering process worked in linear to the iteration process to minimizing the objective function. The objective function is an addition of the multiplication between the coordinates distance towards their closest cluster centroid and their membership degree. The more the iteration process, the objective function should get lower and lower. The objective of this research is to observe whether the distances which usually applied are able to fulfill the aforementioned hypothesis for determining the most suitable distance for Fuzzy C-Means clustering application. Few distance function was applied in the same dataset. 5 standard datasets and 2 random datasets were used to test the fuzzy c-means clustering performance with the 7 different distance function. Accuracy, purity, and Rand Index also applied to measure the quality of the resulted cluster. The observation result depicted that the distance function which resulted in the best quality of clusters are Euclidean, Average, Manhattan, Minkowski, Minkowski-Chebisev, and Canberra distance. These 6 distances were able to fulfill the basic hypothesis of the objective function behavior on Fuzzy C-Means Clustering method. The only distance who were not able to fulfill the basic hypothesis is Chebisev distance

    Pengantar Statistika Matematika 2

    Get PDF

    Implementation of Takagi Sugeno Kang Fuzzy with Rough Set Theory And Mini-Batch Gradient Descent Uniform Regularization

    Get PDF
    The Takagi Sugeno Kang (TSK) fuzzy approach is popular since its output is either a constant or a function. Parameter identification and structure identification are the two key requirements for building the TSK fuzzy system. The input utilized in fuzzy TSK can have an impact on the number of rules produced in such a way that employing more data dimensions typically results in more rules, which causes rule complexity. This issue can be solved by employing a dimension reduction technique that reduces the number of dimensions in the data. After that, the resulting rules are improved with MBGD (Mini-Batch Gradient Descent), which is then altered with uniform regularization (UR). UR can enhance the classifier's fuzzy TSK generalization performance. This study looks at how the rough sets method can be used to reduce data dimensions and use Mini Batch Gradient Descent Uniform Regularization (MBGD-UR) to optimize the rules that come from TSK. 252 respondents' body fat data were utilized as the input, and the mean absolute percentage error (MAPE) was used to analyze the results. Jupyter Notebook software and the Python programming language are used for data processing. The analysis revealed that the MAPE value was 37%, falling into the moderate are

    An epidemic model with viral mutations and vaccine interventions

    Get PDF
    Prediction is a means of forecasting a future value by using and analyzing historical or current data. A popular neural network architecture used as a prediction model is the Recurrent Neural Network (RNN) because of its wide application and very high generalization performance. This study aims to improve the RNN prediction model’s accuracy using k-means grouping and PCA dimension reduction methods by comparing the five distance functions. Data were processed using Python software and the results obtained from the PCA calculation yielded three new variables or principal components out of the five examined. This study used an optimized RNN prediction model with k-means clustering by comparing the Euclidean, Manhattan, Canberra, Average, and Chebyshev distance functions as a measure of data grouping similarity to avoid being trapped in the local optimal solution. In addition, PCA dimension reduction was also used in facilitating multivariate data analysis. The k-means grouping showed that the most optimal distance is the average function producing a DBI value of 0.60855 and converging at the 9th iteration. The RNN prediction model results evaluated based on the number of RMSE errors which was 0.83, while that of MAPE was 8.62%. Therefore, it was concluded that the K-means and PCA methods generated a more optimal prediction model for the RNN metho

    Optimization of markov weighted fuzzy time series forecasting using genetic algorithm (GA) AND particle Swarm Optimization (PSO)

    Get PDF
    The Markov Weighted Fuzzy Time Series (MWFTS) is a method for making predictions based on developing a fuzzy time series (FTS) algorithm. The MWTS has overcome certain limitations of FTS, such as repetition of fuzzy logic relationships and weight considerations of fuzzy logic relationships. The main challenge of the MWFTS method is the absence of standardized rules for determining partition intervals. This study compares the MWFTS model to the partition methods Genetic Algorithm-Fuzzy K-Medoids clustering (GA-FKM) and Fuzzy K-Medoids clusteringParticle Swarm Optimization (FKM-PSO) to solve the problem of determining the partition interval and develop an algorithm. Optimal partition optimization. The GA optimization algorithm’s performance on GA-FKM depends on optimizing the clustering of FKM to obtain the most significant partition interval. Implementing the PSO optimization algorithm on FKM-PSO involves maximizing the interval length following the FKM procedure. The proposed method was applied to Anand Vihar, India’s air quality data. The MWFTS method combined with the GA-FKM partitioning method reduced the mean absolute square error (MAPE) from 17.440 to 16.85%. While the results of forecasting using the MWFTS method in conjunction with the FKM-PSO partition method were able to reduce the MAPE percentage from 9.78% to 7.58%, the MAPE percentage was still 9.78%. Initially, the root mean square error (RMSE) score for the GA-FKM partitioning technique was 48,179 to 47,01. After applying the FKM-PSO method, the initial RMSE score of 30,638 was reduced to 24,863

    Fuzzy Support Vector Machine Using Function Linear Membership and Exponential with Mahanalobis Distance

    Get PDF
    Support vector machine (SVM) is one of effective biner classification technic with structural risk minimization (SRM) principle. SVM method is known as one of successful method in classification technic. But the real-life data problem lies in the occurrence of noise and outlier. Noise will create confusion for the SVM when the data is being processed. On this research, SVM is being developed by adding its fuzzy membership function to lessen the noise and outlier effect in data when trying to figure out the hyperplane solution. Distance calculation is also being considered while determining fuzzy value because it is a basic thing in determining the proximity between data elements, which in general is built depending on the distance between the point into the real class mass center. Fuzzy support vector machine (FSVM) uses Mahalanobis distances with the goal of finding the best hyperplane by separating data between defined classes. The data used will be going over trial for several dividing partition percentage transforming into training set and testing set. Although theoretically FSVM is able to overcome noise and outliers, the results show that the accuracy of FSVM, namely 0.017170689 and 0.018668421, is lower than the accuracy of the classical SVM method, which is 0.018838348. The existence of fuzzy membership function is extremely influential in deciding the best hyperplane. Based on that, determining the correct fuzzy membership is critical in FSVM problem

    Korespondensi

    Get PDF

    Developing an optimized recurrent neural network model for air quality prediction using K-Means clustering and PCS dimension reduction

    Get PDF
    Prediction is a means of forecasting a future value by using and analyzing historical or current data. A popular neural network architecture used as a prediction model is the Recurrent Neural Network (RNN) because of its wide application and very high generalization performance. This study aims to improve the RNN prediction model’s accuracy using k-means grouping and PCA dimension reduction methods by comparing the five distance functions. Data were processed using Python software and the results obtained from the PCA calculation yielded three new variables or principal components out of the five examined. This study used an optimized RNN prediction model with k-means clustering by comparing the Euclidean, Manhattan, Canberra, Average, and Chebyshev distance functions as a measure of data grouping similarity to avoid being trapped in the local optimal solution. In addition, PCA dimension reduction was also used in facilitating multivariate data analysis. The k-means grouping showed that the most optimal distance is the average function producing a DBI value of 0.60855 and converging at the 9th iteration. The RNN prediction model results evaluated based on the number of RMSE errors which was 0.83, while that of MAPE was 8.62%. Therefore, it was concluded that the K-means and PCA methods generated a more optimal prediction model for the RNN method

    Cek Similarity Alternative solution optimization quadrati form with fuzzy number parametrs

    Get PDF
    The fuzzy number is one of the alternatives to maximize solving a quadratic model programming problem. Based on that statement, this paper provides one of the methods to solve the quadratic model programming problem. It started with a general discussion on quadratic model programming and continued by transposing a basic form into a fuzzy quadratic programming equation and giving a reference to solve that problem. Finally, a few examples are provided to analyse how accurate this method works
    corecore